Temporal difference models describe higher-order learning in humans

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Models to Describe Temporal Growth in Local Chickens of Ghana

The logistic, Gompertz, Richards and asymmetric logistic growth curve models were fitted to body weight data of local Ghanaian chickens and French SASSO T44 chickens. All four growth models provided good fit for each sex by genotype growth data with R2 values ranging from 86.7% to 96.7%. The rate constant parameter, k, ranged between 0.137 and 0.271 and were significantly different from zero fo...

متن کامل

The application of temporal difference learning in optimal diet models.

An experience-based aversive learning model of foraging behaviour in uncertain environments is presented. We use Q-learning as a model-free implementation of Temporal difference learning motivated by growing evidence for neural correlates in natural reinforcement settings. The predator has the choice of including an aposematic prey in its diet or to forage on alternative food sources. We show h...

متن کامل

Difference Equations in Massive Higher Order Calculations

The calculation of massive 2–loop operator matrix elements, required for the higher order Wilson coefficients for heavy flavor production in deeply inelastic scattering, leads to new types of multiple infinite sums over harmonic sums and related functions, which depend on the Mellin parameter N. We report on the solution of these sums through higher order difference equations using the summatio...

متن کامل

Dual Temporal Difference Learning

Recently, researchers have investigated novel dual representations as a basis for dynamic programming and reinforcement learning algorithms. Although the convergence properties of classical dynamic programming algorithms have been established for dual representations, temporal difference learning algorithms have not yet been analyzed. In this paper, we study the convergence properties of tempor...

متن کامل

Preconditioned Temporal Difference Learning

LSTD is numerically instable for some ergodic Markov chains with preferred visits among some states over the remaining ones. Because the matrix that LSTD accumulates has large condition numbers. In this paper, we propose a variant of temporal difference learning with high data efficiency. A class of preconditioned temporal difference learning algorithms are also proposed to speed up the new met...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Nature

سال: 2004

ISSN: 0028-0836,1476-4687

DOI: 10.1038/nature02581